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[57] ABSTRACT 

An information aggregation and synthesization system and 
process. The present invention provides aggregation and 
packaging of structured or unstructured information from 
disparate sources such as those available on a network such 
as the Internet. A network compatible/addressable interface 
device is operated by a user. The network interface device 
communicates with local datastores or network accessible 
datastores via an addressing scheme such as Uniform 
Resource Locator addresses (URLs) utilized by the Internet. 
Data passing between the network interface device and the 
datastores is accessed, polled, and retrieved through an 
intermediary gateway system. Such aggregated information 
is then synthesized, customized, personalized and localized 
to meet the information resource requests specified by the 
user via the network interface device. 
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INFORMATION AGGREGATION AND and communicating the monitored activity to a facility and 

SYNTHESIZATION SYSTEM initiating appropriate actions. A controller initiates an auto- 
mated configuration by acquiring configuration information. 

CROSS REFERENCE TO RELATED ^ controller monitors television channel selection infor- 

APPLICATION 5 ma tion and assembles the monitored television information 

This application is a continuation-in-part of U.S. patent into a user profile. An option includes capturing images or 

application Ser. No. 08/685,805 filed Jul. 24, 1996 now U.S. text and forwarding to the user through a mail facility. 

Pat. No. 5,901,287, which is based on Provisional Applica- Remillard differs from the present invention in that it 

tion No. 60/015,384 entitled INFORMATION AGGREGA- suggests a device to access distant information through a 

TION AND S YNTHES IZATION SYSTEM, filed Apr. 1, ™ television set. The present invention utilizes network addres- 

1996. sable information resource and human interface elements 

such as those' used by the Internet, one of which may in fact 

BACKGROUND OF THE INVENTION be atUched tQ , ^ ^millard's invention (or that of others) 

1, Field of the Invention may be used as a means to acquire WWW information but 
The present invention is directed to an information aggre- 15 does not contemplate the present invention. 

gation and synthesization system which connects with local Levinson (U.S. Pat. No. 5,404,505) provides information 

and network accessible datastores through an intermediary in a database which is tagged with indices to form an 

gateway system. hierarchical structure. Software having a set of subscriber 

2. Prior Art requests handling routines interacts with a data filter sub- 
Widespread use of personal computers, modems 2 ° s y stem ' ™ e data fllter subsystem receives incoming data 

(modulator/demodulator devices that enable data to be stream and sclccts thosc P ackcts that meet ccrtain selection 
transmitted) and data connections has allowed the growth of cnteria - A *P ecial smart °* chin Z vouiin Z * Provided for 
computer networks. The Internet serves as an example of a anticipating future requests by the user, 
type of computer network, and indeed, is a large network of 25 Levinson differs from the present invention: 
networks, all inter-connected, wherein the processing activ- 1) Levinson proposes a satellite based information 
ity takes place in real time. The Internet offers mail, file retrieval system. This is based on fixed data sources 
transfer, remote log in and other services. The World Wide (Compuserve, Prodigy) being queried by a user on a tele- 
Web (WWW) is the fastest growing part of the Internet. phone line with the results being returned via a television 

On the World Wide Web (WWW), a technology called 30 connection. The present invention uses a similar infrastruc- 

hypertext allows Internet addressable resources to be ture to return requested information to the user but our 

connected, or linked, to one another. process for identifying content that is relevant is software 

In the past, certain, limited aspects, of the present inven- a S ent based and retrieval of dynamic content is from the 

tion have been proposed, such as monitoring of computer WWW vs - data sources. The present invention can use 

usage. 35 any means: for example, TV, Cable Modem, RF, ISDN, 

Lockwood (U.S. Pat. No. 5,309,355) provides a comput- Modem > fixed line ( T ' 2 ' T ' 3 etc ')' 
erized tool to augment sales and marketing capabilities of 2 ) Levinson would establish user inputted profiles for 
travel agency personnel. The system creates and displays "Automatic Data Retrieval". The present invention supple- 
customized sales presentations from (1) stored client pro- ments user provided profile information by constructing 
files; (2) travel agent assessment of client profiles; and (3) 40 implicit profile recognition patterns, based upon historical 
computerized reservation system responses to client profiles. search activity. 

Selected factors are analyzed by the operating program 3) Levinson* s invention does not specify any of the six 

based upon an organization hierarchy of specifications. components proposed in the present invention. 

Lockwood differs from the present invention in: Griffin et aL (U.S. Pat. No. 5,422,809) provides an infor- 

1) Data sources — Lockwood uses content from both a 45 mation storage and retrieval system for storing, referencing 
videodisk (static) and computerized reservation systems and retrieving various travel information from a database. A 
(dynamic). The present invention is capable of deriving querying device queries the user for input used to define the 
content from totally dynamic sources on the World Wide field for the travel destination desired. Statistical records are 
Web (including Internet and local datastores or caches produced which provide relevant information relating to 
simulating a WWW component). 50 travel destinations using the system. Information is thus 

2) Client Profiles— Lockwood proposes that these be provided which can be used to evaluate the popularity of 
input by a Travel Agent. In the present invention, profiles are particular destinations. 

entered by the consumer (explicit) or collected through Griffin et al. differs from the present invention in that it 

analysis of online session activity (implicit). 55 discloses a kiosk system and the processes and subprocesses 

3) Data Organization — Lockwood uses preindexed vid- for self service travel planning and reservations. While the 
eodisks. The present invention indexes prequalified WWW present invention provides similar capability using other 
sites, updating these as they change or as users expand their means, the six features of the present invention are not 
WWW searches. disclosed in this patent. 

4) Programation— Lockwood places the entire index of 60 Senda 0 J S - Pat - No - 5,459,859) discloses an information 
information in a PROM. This index is exercised by the providing system using a communication network which 
sequencer which displays a sales presentation. The present stores attribute/schedule information from each subscriber 
invention stores indices in magnetic medium but retrieval and uses that information to match with other subscribers, 
and presentation of the indexed information is executed Senda differs from the present invention in that it is a 
dynamically on premised upon user input. 65 software based system for meeting a system while traveling. 

Remillard (U.S. Pat. No. 5,404393) discloses an elec- It involves a best fit match between profiles. The present 

tronic device and method for monitoring television activity invention also provides a "best fit" but between software 
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agents and data being viewed. Senda has both formatted 
selection and source data inputted for a specific purpose (to 
meet someone). The present invention uses software agents 
to format selection data but the source data is unformatted 
from the WWW. 

Belove et al. (U.S. Pat. No. 5,491,820) discloses a storage 
transmission mechanism for retrievable items and may be 
used on the Internet. The system may include a filter on each 
client or on the server between the user and the Internet. 

Belove et al. differs from the present invention in that it 
is a client server object caching system. Except for the 
pruning mechanism that limits the information cached at the 
client side, there is no resemblance to the present invention. 

Accordingly, it is a principal object and purpose of the 
present invention to provide an information aggregation and 
synthesization process and system connecting a network 
operable device and a plurality of local or network acces- 
sible datastores wherein data passing there between is 
accessed, polled and retrieved through an intermediary gate- 
way system. 

SUMMARY OF THE INVENTION 

The present invention includes at least six different 
aspects or functional components which are related, all 
involving use of a computer accessible data network such as 
the Internet. While the individual aspects may be utilized 
together, they may also be used separately. 

The user initiates access to the system through a network 
addressable interface device (such as a personal computer, 
Internet Appliance, an interactive television or even a per- 
sonal digital assistant or smart telephone). The user is then 
connected to the information aggregation and synthesization 
system via a network service provider (most likely through 
the Internet or some variation). The user logs on to the 
system either by name, address, or with some pseudonym 
(or some combination). This allows the user's activity to be 
tracked and establishes a log of the user's activity during the 
current online experience (session). The user is also asked 
for explicit profile information concerning preferences. 
These preferences will be used to narrow the information 
retrieval and may be collected when the user first logs in or 
incrementally as the user asks for specific information. This 
profile information will be kept and updated as the indi- 
vidual user's preferences change. 

Once the user is logged in, the information aggregation 
and synthesization system will facilitate the user's access to 
local information or information distributed on a network 
(this network could be a local area network or a wide area 
network such as the Internet). All user access to information 
is through the system. 

This information is topically oriented (Germany travel, 
the Olympics, Spring Break or even new cars), composed of 
files and file references using the Hypertext Markup Lan- 
guage ("HTML") or similar tagged reference format that 
may be prescreened for relevance and appropriateness. 
Selected text can be "expanded" at any time to provide other 
information. These words are, thus, linked to other docu- 
ments. This information is indexed in this fashion in advance 
of the user's logging in. 

A gateway is provided into the WWW for shopping while 
retaining the user passing through the information aggrega- 
tion and synthesization system. A gateway is provided to 
poll, access and retrieve information from various locations. 
A filtering process is provided and the resulting information 
is returned to the requested party. 

The user is presented with a variety of search, display and 
output options. The search options include: 1) Search using 
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key words or combinations; 2) Use of complex software text 
search agents that have been predefined by the information 
aggregation and synthesization system site operators. These 
agents take advantage of the expansive subject matter exper- 

5 tise in understanding which search parameters will best 
serve the user's search needs; 3) Use of search patterns and 
agents from this user's previous sessions, perhaps expanded 
by available specials and promotions; 4) Natural Language 
Query; and 5) Some combination of 1), 2), 3) and 4). 

10 The user selects information to be viewed from the results 
of the search. This information is retrieved from its source 
and presented to the user in the manner and at the time 
requested. The available display options include but are not 
limited to: display on the user's network capable device, 

1 5 personal TV channel, customized Internet page, custom 
CD-ROM, electronic mail, mobile devices (Personal Digital 
Assistants, telephones and pagers) and facsimile. Informa- 
tion retrieval and display can be text, still pictures, videos, 
Interactive multimedia, audio and geographic. 

20 In certain situations, data from the datastores destined for 
the user is converted prior to delivery to the user. The data 
stream returned to the user may be modified to fit the 
bandwidth, character set and display limitations of the 
network and may be modified to meet the limitations of the 

25 user interface device. 

Along with displays, including those for data entry, 
searches, search results, information retrieval, the user will 
be presented with advertisements and/or coupons based on 

30 criteria entered by advertisers. This criteria may take the 
form of simple logic, linking an ad/coupon with a display or 
be derived from complex software text search agents that 
analyze one or more of the following: The user's looking 
pattern, the user's psychographic profile, the user's personal 

35 profile, the availability of the advertiser' s/couponer's goods 
or services at the instant in time that the criteria is being 
exercised. The placement of the ad/coupon will be logged 
along with user profile information and provided to the 
advertiser/couponer in some form of report. 

40 During a user session or when a user completes a session, 
the user's looking activity is analyzed for patterns, prefer- 
ences and trends and the profile annotated or updated so that 
when they next use the information aggregation and syn- 
thesization system, the nominated searches will be custom- 

45 ized to their individual desires. 

The six aspects of the information aggregation and syn- 
thesization system are: 

I. URL Munging 

The World Wide Web ("WWW") is characterized by 
50 computer (user) connection through an Internet Service 
Provider to any WWW address or site. Hence, use of the 
WWW is like placing individual telephone calls to many 
merchants, trying to compare products and services. URL 
Munging is the process that allows the goods and services of 
55 many merchants to be displayed through a single virtual 
shopping center. 

This involves encapsulating and indexing the content of 
various merchants as well as modifying parts of the internal 
structure, repurposing and redirecting it to be integrated into 
60 the information aggregation and synthesization process. 
This allows content from and access to multiple merchants 
to be aggregated, synthesized and accessed at a single 
WWW site. 

II. WWW CD-Rom 

65 World Wide Web ("WWW") access from homes is often 
constrained by the lack of sufficient data communications 
bandwidth within a typical residential infrastructure (WWW 
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information may be accessed through the Internet WWW, a 
local Internet WWW, or a local datastore or cache simulating 
a WWW component). 

The Internet user will select World Wide Web (WWW) 
content for retrieval using a search engine to return selected 5 
WWW references. The user will then select certain of these 
references to be included in a custom CD which will be 
burned or recorded onto a CD and then sent by express 
delivery to the user. 

III. Software Agent Advertising Insertion io 
Currently, advertisements in WWW pages are tightly tied 

to each page, are inserted based on keywords or on a 
psychographic profile of the user. 

Certain criteria will be entered which delineates a pattern 
that is requested to be monitored. When this pattern is seen is 
(or is in close match) in the user's WWW activity, the 
insertion mechanism is activated. If a certain web page is 
requested, the present invention will display a particular 
advertisement. The ad will be inserted based on the content 
of the existing web page being read. An analysis of the text 20 
stream of the user's interactive session will be performed 
on-line. For instance, if the user accesses web pages for 
Holiday Inns on the West Coast, the insertion mechanism 
could be established to automatically insert ads for Hilton 
Inns on the West Coast. 25 

IV. Automated Profile Generation 

Presently, user's profiles are collected based on explicit 
entry by the user, and extraction from demographic data 
collected from a variety of sources. 

In the present invention, the searching patterns of the user 30 
on the Internet are monitored. A set of software text agent 
profiles is developed and may be integrated with explicitly 
collected profile information. The automated profile genera- 
tion will have both explicit profile information gathering and 
implicit profile information gathering capabilities. 35 

As the user uses the information aggregation and synthe- 
sization system, the pattern of information being viewed is 
analyzed. During a user's session, advanced text analysis 
tools are used in real-time to understand the interests of the 
user by synthesis of the text stream of pages looked at. This 40 
synthesis is used as input to a statistical correlation with 
similar interests of a larger population. The results of this 
correlation are used to predict the extended interests of the 
user. These are matched using intelligent software text 
agents and a variety of reasoning techniques. The user is 45 
presented with search ideas as well as promotions and 
specials from suppliers based on these searching patterns. 

V. Automated Lead Generation Currently, leads are gener- 
ated by recording user's WWW site selection. (For Example, 
user's visiting a "Chicago" information site would be "Chi- 50 
cago" leads.) 

In the present invention, the user WWW viewing patterns 
are recorded. These and optionally the user's profile are 
matched against software text agents entered by a supplier. 
When these agents match a pattern/profile, the supplier is 55 
notified. When this profile is approximately matched, the 
supplier is notified. 

VI. Software Agent Unmet Needs Generation. 
Currently, there is no on-line immediately accessible 

system to analyze unmet needs of Internet users. 60 

In the present invention, records will be maintained from 
user usage of the Internet on what consumer queries are 
unmet by the WWW content retrieved. The invention will 
intuitively construct a profile from user inputted data. This 
will be done by recognizing unanswered queries and/or user 65 
initiated requests. From this, a profile will be developed to 
identify new markets. As an example, if one hundred people 
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inquire about snorkeling off the coast of Texas, this infor- 
mation could be sold to a tour provider who could not only 
prepare a travel package but sell the leads to a company. 
Thus, the system will be able to gather "negative" leads. 

In the course of a session, the user may desire information 
not yet available. This information could be in the form of 
a product, a service or an event. The user then can establish 
a persistent (stays around after the user's session is over) 
complex software text search agent to monitor future infor- 
mation additions to the System and alert the user through a 
variety of means (facsimile, electronic mail, text page, 
voice, pager) that the information that was requested is 
available or in some instances, provide the information 
directly. The set of persistent agents will also be analyzed by 
the information aggregation and synthesization system 
operators and provided to potential suppliers who would in 
turn develop new product offerings which would be added to 
the information aggregation and synthesization system 
sources. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in 
and constitute a part of the specification, illustrate presently 
preferred embodiments of the invention and, together with 
the preceding general description and the following detailed 
description, explain the principles of the invention. In the 
drawings: 

FIG. 1 illustrates an interface of the present system with 
a user access system and various data sources; 

FIG. 1A illustrates a modified arrangement of the inter- 
face of present system with alternate user access systems 
and alternate network interface devices; 

FIG. IB illustrates a limited bandwidth limited character 
set subsystem consistent with the present invention. 

FIG. 2 illustrates several datastore categories and an I/O 
system consistent with the present invention; 

FIG. 3 illustrates dialog management and agent datastore 
categories consistent with the present invention; 

FIG. 4 illustrates operation systems categories consistent 
with the present invention; 

FIG. 5 illustrates a flow diagram for a WWW CD ROM 
consistent with the present invention; 

FIG. 6 illustrates a flow diagram for a software agent 
advertising insertion consistent with the present invention; 

FIG. 7 illustrates a flow diagram for automated profile 
generation consistent with the present invention; 

FIG. 8 illustrates a flow diagram of lead generation 
consistent with the present invention; and 

FIG. 9 illustrates a flow diagram for an unmet need agent 
consistent with the present invention. 

DETAILED DESCRIPTION 

In the embodiments described herein and accompanying 
figures, a travel information scenario is depicted. It will be 
understood that the present invention is capable of perform- 
ing similarly for other venues, such as mortgages, automo- 
bile sales and any other interactive exchange of information 
sought by information content seekers and potentially sat- 
isfied by information content providers. 

Initial Setup For User 

Referring to the drawings in detail, FIG. 1 illustrates a 
diagram showing the interface of the present system 200 
with a user on a user access system 100 and various data 
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sources. FIG. 2 illustrates several of the datastore categories. 
The use of the present invention has at least five phases: 

Initial Setup For User 

Initial Setup For Advertisers and Lead Generation 

Ongoing Maintenance 

User Session 

Post Session Activity 

A theme or definition of a class of information (e.g., 
central California travel and tourism or new automobiles) is 
identified. Data sources (Local DataS tores (500 . . . N) and 
Network Accessible DataStores (300 . . . N)) are screened 
for relevance, quality of information and appropriateness (or 
may be included de facto based on their title or description). 
These are indexed using a text indexing software tool 2981 
and the indices stored on the system index DataStore 220. 
An initial set of Preestablished Software Text Agents are 
defined. These agents are words or combinations of words 
that form a word based search pattern. This initial set of 
agents is relevant to the searches that might be performed 
against the class of information that was indexed, (i.e., 
Agents about automobiles would be developed to search a 
class of indexed information about new cars). These are 
stored in the Preestablished Software Text Agent DataStore 
231. The System 200 uses any multipurpose computer 
central processing units with the ability to handle multiple 
inputs and outputs with the necessary hard disk storage and 
to run World Wide Web (WWW) or other network server 
software. 

FIG. 1A illustrates a modified arrangement of the inter- 
face of the present system 200 with alternate user access 
systems and alternate network interface devices. 

The present system 200 is in communication with a 
limited band width limited character set system (LBLCS) 
289 which is a subsystem of input/output system 280. 

Although today's WWW access is normally with broad 
band, high speed networks, many corporate intranets operate 
on limited capability, slow speed networks. The LBLCS 
system 289 allows conversion of the rich media used on 
today's WWW into text-only media with multi -media ref- 
erences as anchors that preserve the essential information to 
be passed in HTML or other tagged reference format to the 
user For users with limited band width limited character set 
networks, the WWW datastore information which is 
returned to the user is altered. Any graphics files are 
identified, eliminated and replaced with a text anchor. For 
example, certain networks or user access systems can not 
handle graphics files. A text page which is returned to the 
user 110 or 120 which contains graphic files will be iden- 
tified. The graphic file itself will be eliminated and in its 
place a text reference, such as "(picture)", is inserted. 

User access system 110 is connected through a limited 
private network to the LBLCS 289 subsystem. FIG. lb 
illustrates a block diagram of the LBLCS subsystem. 

User interface system 120 illustrates a connection through 
a limited dial network into the LBLCS subsystem 289. 

The return datastream from the datastores to the user is 
modified to fit the bandwidth, character set and display 
limitations of the network and of the user access device. 

In one implementation of the present system, terminals 
for travel agents may be provided with access to the system 
20. In certain cases, travel agent terminals are much more 
limited than ordinary personal computer CPU's. Through 
the usage of present invention, agents will be provided 
access to the information aggregation and synthesization 
system 200. 
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Initial Setup For Advertisers and Lead Generation 
Advertisers 

Advertisers, using a user access system 100 enter criteria 
that should be met for an advertisement/coupon placement. 

5 These criteria are in the form of the complex software text 
search agents described above. This includes a match 
"threshold". When this threshold is met or exceeded, an 
ad/coupon will be appended to a system session. Statistical 
analysis known as clustering is used to evaluate the data, 

1Q The ad/coupon may be resident on the user access system 
100, an advertiser's computer system (400 ... N) or stored 
in the Advertising DataStore 250. Additionally, the Adver- 
tiser may include conditional criteria for ad/coupon place- 
ment (available inventory, in stock levels, excess capacity, 
etc.). This criteria is referenced when the "threshold" is met 

15 and if satisfactory, the ad/coupon is appended. This criteria 
may be tested against data input through the user access 
system 100, data on the advertising DataStore 250 or data on 
the advertiser's computer system (400 . . . N). Additionally, 
advertisers can input World Wide Web (WWW) referential 

20 information (hot links) to be displayed with ads/coupons or 
on geographic map displays. These are stored on the adver- 
tising DataStore 250. 
Lead Generation 

Lead Purchasers, using a user access system 100 enter 

25 criteria that should be met for the generation of a lead. These 
criteria are in the form of the complex software text search 
agents described above. This includes a match "threshold". 
When this threshold is met or exceeded, information about 
the current user and the information being viewed is stored 

30 in the lead DataStore 270 for variable output transmission to 
the lead purchaser. 

Ongoing Maintenance 

Index Updating 

Local DataStores (500 . . . N) and network accessible 

35 DataStores (300 . . . N) will change randomly and will 
become out of synchronization with the system index DataS- 
tore 220. The data monitoring system 2982 will periodically 
monitor local DataStores (500 . . . N) and network accessible 
DataStores (300 . . . N) and when there is a change, update 

40 the index DataStore 220. 
Data Addition 

Operators will add data to the local DataStores (500 . 
N) and users using a user access system 100 will nominate 
data from the network accessible DataStores (300 ... N) to 

45 be added to the index DataStore 220. Operators will update 
the indices using the data indexing service 2981 if the data 
passes the screening outlined in the initial setup for users 
above. 

50 User Session 

Login and Profiles 

Browsing 

Data Retrieval 
55 User Interrupt 

Ad/Coupon Insertion 

Persistent Agents 
Login and Profiles 

Users using a user access system 100 access the infor- 
60 mation aggregation and synthesization system 200 through 
the Internet or other public or private network. The user 
either logs in by name or by pseudonym or from data 
previously stored in the user access system 100. New users 
create an account on the user profile DataStore 210. Previous 
65 users are identified to an existing account. The user is 
presented with a variety of options to create or update profile 
information in the user profile DataStore 210. This involves 
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a single data entry option or many mini-options based on the 

browsing activity. 

Browsing 

The user is also presented with browsing options based 
on: activity from a previous session in the browsing activity 
DataStore 240; predeveloped software text agents and per- 
sonalized software text agents (developed in the Post Ses- 
sion Activity) stored in the Personal Search Text Agent 
DataStore 232; or combinations of all as well as situational 
opportunities developed by the user greeting subsystem 291. 
The user selects the search options to be used (or simply 
enters search criteria directly). This search criteria is used to 
search the index DataStore 220 and a list of data sources is 
presented to the user for selection. The user indicates the 
information to be viewed. The user will also be presented 
with options to refine his search through the altering of 
search agent criteria (Search Reduction System 293). 
Data Retrieval 

The requested data is retrieved either from local DataS- 
tores (500 . . . N) or network accessible DataStore (300 . . 
. N) and presented to the user via the session management 
system 292. The user may jump to data referenced in the 
presented data. Subject to the appropriate policies of the site 
operation, the session management system 292 will further 
retrieve and present this data to the user. The user may 
request that data be overlaid on a geographic display using 
the Geographic Display I/O System 287 so that referenced 
information may have geographic relevance. 
User Interrupt 

The user interrupt system 294 will periodically notify the 
user of specialized software text agents that they may want 
to pursue. These Agents are stored in the agent DataStore 
230 and are derived by the real time session analysis system 
295 which monitors the browsing activity DataStore 240 
during the user's session. 
Ad/Coupon Insertion 

During the session, ads/coupons are inserted alongside 
displayed data (text, picture or index displays) from the ad 
DataStore 250, based on ad/coupon insertion agents 233 and 
inserted by the session management system 292. A Record 
of Insertion along with appropriate user information (may be 
general or precise to the name of the user) is stored in the 
advertising activity DataStore 260. 
Persistent Agents * 

At any time, the user may establish a persistent software 
Text Agent (using the persistent agent entry system 297, 
stored in the unmet needs agent DataStore 234) with criteria, 
if met sometime in the future, will cause the user to be 
notified through the I/O System 280. These can be explicit 
or implicit query parameters. 

Post Session Activity 

Periodically, either due to a preset time interrupt, or user 
or advertiser event driven activity, the following can occur: 

Unmet Needs Analysis 

Advertising Report 

Profile Updating 

Lead Report 

Targeted Output 

Output Activity 
Unmet Needs Analysis 

Users using the user access system 100 will be able to 
establish persistent (stays in the system after the user quits 
using the system) software text agents which describe some 
criteria, which, if met, will cause them to be notified. These 
are stored in the unmet needs agent DataStore 234. These 


10 


15 


20 


unmet needs agents 234 are analyzed using the unmet needs 
analysis system 299 and reports are created through the I/O 
System 280 for suppliers who could potentially meet those 
needs. 

Advertising Report 

Information about each Ad/Coupon appended to an infor- 
mation aggregation and synthesization system along with 
known information about the user is stored in the advertising 
activity DataStore 260. This is reported out periodically to 
the advertisers/couponers using the I/O System 280. 
Profile Updating 

During a session or after a user discontinues use, the data 
viewed (recorded in the browsing activity DataStore 240) is 
analyzed by the session profile update 2921 and the user 
profile DataStore 210 is updated with keywords or personal 
search text agent DataStore 232. 
Lead Report 

Periodically, the Software Text Lead Agents stored in the 
lead generation agent DataStore 235 are used to analyze the 
data viewed (recorded in the browsing activity DataStore 
240) and reports prepared for lead purchasers using the I/O 
System 280. 
Targeted Output 

Users through the user input system 100 will be able to 
designate information to be output and the format that the 
I/O System 280 will use. 
Output Activity (Using the I/O System 280) 

All output systems will provide for the addition of 
specials, ads and/or coupons. 
Options are 

Personalized Page 281 — This will create a page acces- 
sible through the WWW where the user can access requested 
information. 

SMTP Electronic Mail 282 — This allows the delivery of 
user requested information using the SMTP capability of the 
Internet and other popular electronic mail systems. 

CCITT Class 3 or Class 4 Facsimile 283— This allows 
user requested data to be formed as a printed page and sent 
via Fax to a Fax receiver of the user's choice. 

Voice output direct or to a Voice Mail Box 284 — This 
translates the user requested data to audio, connects to the 
user or their voice mail system and transmits the audio. 

Personal TV or video feed 285 — This formats the data in 
a form compatible with transmitted video and allows view- 
45 ing on demand. 

Custom CD-ROM 286 — This places the requested data, 
indices, viewers and all necessary software on a user Unique 
CD-ROM for physical delivery. 

Geographic Display I/O System 287 — This allows the 
user to view content geographically, to look at the geo- 
graphic proximity of merchants and services and provides a 
vehicle for ads and hot links. 

Mobile/Portable System 288 — This allows Specially for- 
matted Genie Information to be displayed or translated for a 
wide variety of mobile and portable devices. 
Identification of Key System Components by Reference 
Numerals 

100 User Access System 

110 Limited private network user access system 
120 Limited dial network user access system 
200 System comprised of 
210 User Profile DataStore 
220 Travel Genie Index DataStore 
230 Agent DataStore 

231 Preestablished Software Text Agents 

232 Personal Search Text Agents 

233 Ad/Coupon Insertion Agents 


30 


35 


40 


50 


55 


60 


65 
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234 Unmet Need Agents 

235 Lead Generation Agents 
240 Browsing Activity DataStore 
250 Advertising DataStore 

260 Advertising Activity DataStore 
270 Lead DataStore 
280 I/O System 

281 Personalized Page Output System 

282 SMTP Electronic Mail System 

283 CCITT Class 3 or Class 4 Facsimile 

284 Voice Output 

285 Personal TV or Video Feed 

286 Custom CD-ROM 

287 Geographic Display I/O System 

288 Mobile/Portable Device System 

289 Limited Bandwidth Limited Character Set System 
290 Operations System 

291 User Greeting System 

292 Travel Genie Session Management System 
2921 Session Profile Update 

293 Search Reduction System 

294 User Interrupt System 

295 Real Time Session Analysis System 

296 Ad/Coupon Insertion System 

297 Persistent Agent Entry System 

298 Data Support Systems 

2981 Data Indexing Service 

2982 Data Monitoring System 

299 Unmet Needs Analysis System 
300 Network Accessible DataStores 

301 ... N 
400 Advertiser's Computer Systems 

401 ... N 
500 Local DataStores 

501 ... N 
100 User Access System 

This is a network addressable interface device, such as a 
conventional personal computer capable of initiating and 
maintaining a network connection and sending, receiving 
and displaying data including a digitized data visual repre- 
sentation device such as a monitor and auxiliary storage, 
such as a floppy disk drive. It may also be a TV set, smart 
telephone or network appliance with similar capabilities. It 
will maintain a connection through a modem (a modulator/ 
demodulator device) that enables data to be transmitted and 
received. 
200 DataStores 

FIG. 2 illustrates DataStores utilized as a part of the 
invention. The information aggregation and synthesization 
system includes: 
210 User Profile DataStore 

This contains data about the user, preferences, situational 
preferences, accounting information, psychographic profile, 
personal profile and other relevant information related to the 
user by individual identifier. 
220 System Index DataStore 

This is the index of data accessible by the system. 

It is established initially and updated as data changes or 
new data sources are added. It is queried by Agents from the 
Agent DataStore 230 or by key words. 
230 Agent DataStore 

231 Preestablished Software Text Agents 
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35 


45 


55 


60 


65 


These are complex software text search patterns pre- 
defined by the site subject matter experts using their exten- 
sive knowledge of information contained within the site's 
indices. 

232 Personal Search Text Agents 

These are complex software text search patterns that may 
be individual words or word sets and/or combinations of 
words and Preestablished Software Text Agents 231 includ- 
ing the results of the post session analysis 2921 that provide 
individually customized searching of the Index DataStore 
220. 

233 Ad/Coupon Insertion Agents 

These are complex software text search patterns that when 
matched within the text being reviewed within a given 
session, cause an advertisement/coupon to be added into the 
display. These can be direct insertion or conditioned from 
criteria on the Advertiser's Computer Systems (400 . . . N) 
and/or the user's profile from the user profile DataStore 210 

234 Unmet Need Agents 

These are complex software text search patterns created 
by the user to persist after the end of the user session looking 
for patterns and/or specific events or data that are observed 
within the System 200 at some future time. 

235 Lead Generation Agents 

These are complex software text search patterns that when 
matched within the text being reviewed within a given 
session, causes an addition to the Lead DataStore 270 for 
output to the lead purchaser using the I/O System 280. 
240 Browsing Activity DataStore 

This is the record of the "looking" activity of each user in 
each session. 

250 Advertising DataStore 

This is the storehouse of ads to be presented when a match 
is made by the Ad/Coupon Insertion Agent 233 
260 Advertising Activity DataStore 

This is the record or ads presented by the Ad/Coupon 
Insertion System 296 and information about the user seeing 
the ads from the Browsing Activity DataStore 240 and the 
user profile DataStore 210 
270 Lead DataStore 

When a Lead Generation Agent 235 makes a match, Data 
about the user from the user profile DataStore 210 and the 
Browsing Activity DataStore 240 is stored here. 
280 I/O System 

These are the various ways that output can be channeled, 
for the user, the advertiser or the lead purchaser. 

281 Personalized Page Output System 

This allows output text and associated objects to be 
formatted for general or selective viewing through any 
system using Hypertext Markup Language (HTML), VRML 
(Virtual Reality Modeling Language) or other network com- 
patible display based language either locally or over a 
network. 

282 SMTP Electronic Mail System 

This allows output text for whatever purpose to be for- 
matted in a format compatible with the SMTP (Simple Mail 
Transport Protocol) and transmitted to a designated 
addressee, 

283 CCITT Class 3 or Class 4 Facsimile 

This allows output text and associated objects for what- 
ever purpose to be formatted to be compatible with the 
CCITT Class 3 or Class 4 Fax standard and transmitted to a 
designated fax receiver. 

284 Voice Output 

This allows output text for whatever purpose to be for- 
matted into voice for transmission to a human receiver or a 
voice mail box. 
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285 Personal TV or Video Feed 

This allows output text and associated objects for what- 
ever purpose to be formatted as a TV signal (any interna- 
tional standard) to be accessed and replayed using local or 
network capability at the request of an individual user (or a 
class of users). 

286 Custom CD-ROM 

This allows the user to designate certain data to be placed 
onto a CD-ROM along with all necessary search and view- 
ing software as well as non user requested ads and promo- 
tions. 

287 Geographic Display I/O System 

This allows data requested by the user to be overlaid on 
a geographic reference system (a map). 

288 Mobile Device System 

This allows output to be formatted for a variety of devices 
including but not limited to: pagers, personal digital 
assistants, mobile computing devices and other wireless 
devices. 

289 Limited Bandwidth, Limited Character Set (LBLCS) 
Data Network 

The software module input/output system identifies 
graphic files, removes them and replaces them with text 
anchors. The LBLCS module may be resident on the I/O 
system 280 or be established on separate hardware. 
290 Operations System 

291 User Greeting System 

This is the subsystem that identifies users, customizes 
search screens, incrementally collects explicit profile infor- 
mation and formulates search agent screens and search 
specials which may be situational or seasonal or both. 

292 Session Management System 

This tracks and records a user's browsing activity, sets ID 
tokens, establishes accounts, translates anonymous users to 
named users and manages the user's implicit profile infor- 
mation. 

2921 Session Profile Update 

Uses the Browsing Activity DataStore 240 records, to 
analyze and update the user's profile in the user profile 
DataStore 210 

293 Search Reduction System 

This aids the search by suggesting changes to the complex 
software text search agents to refine the user's search. 

294 User Interrupt System 

Based on the Real Time Session Analysis 295 of the users 
looking activity (stored in 240), determines associated 
references, agents or other information to be offered to the 
user and interrupts the user's session with an interactive data 
screen. 

295 Time Session Analysis System 

This monitors the user's browsing activity and analyzes 
the apparent interests to trigger the user interrupt system 
294. 

296 Ad/Coupon Insertion System 

This looks at the current display requested by the user 
with a Ad/Coupon Insertion Agent 233, determines which 
ads should be placed (or rotated) and makes the placement 
(or establishes the rotation). 

297 Persistent Agent Entry System 

This is the mechanism whereby the user enters the Unmet 
Need Agent 234. This agent monitors text and data changes 
and if the requested data/pattem occurs, the user is notified 
via the I/O System 280. 

298 Data Support Systems 
2981 Data Indexing Service 

This is the facility that indexes designated DataS tores 
(either Network Accessible DataStore (300 ... N) or Local 
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DataStores (500 . . . N) upon operator input or periodically 
and stores these indices in the Index DataStore 220. 
2982 Data Monitoring System 

This facility, periodically or on demand, checks indices 
5 stored in the Index DataStore 220 against actual data (either 
Network Accessible DataStore (300 ... N) or Local DataS- 
tores (500 . . . N)) and if it has changed, queues for operator 
review or updates indices. 

299 Unmet Needs Analysis System 

3Q This analyzes the persistent agents for common patterns 
or specific requests that can be custom tailored. The results 
are outputted through the I/O System 280. 

300 Network Accessible DataStores 
301 ... N 

These are an infinite number of network data sources that 
15 are included in the scope of the information aggregation and 
synthesization. These are represented by 

(300 . . . N) 
400 Advertiser's Computer Systems 

401 ... N 

20 These are DataStores established by advertisers to store 
ads/coupons to be presented or to set additional conditions 
for display. 
500 Local DataStores 
501 ... N 

25 These are similar to the 300 series but locally vs. wide 

area network accessible. 

Each of the six aspects of the present invention will be 

discussed in detail. 

I. URL Munging 
30 The present invention becomes a gateway to network data 

content provided by others. The present invention directs 

access which is controlled through an intermediary gateway 

system. 

The user, through a network addressable interface device 

35 such as the user access system 100, will connect with a local 
or network accessible DataStore. The user will select a page 
(designated by a Uniform Resource Locator or URL) to be 
used. The URL will be modified or "munged" so that 
retrieval must go through the present invention when the 

40 user executes a retrieval request. This then permits return of 
requested data to the user from the DataStore, at all times 
passing through the present invention 200. 

The URLs embedded in each page that pass through are 
indexed by the present invention or "munged" so that any 

45 hyper linking to another WWW site always goes through the 
present invention. As an example, "WWW.anywhere.com" 
is converted to "WWW.travelgenie.com? 
WWW.anywhere.com", even though the user will see a 
direct path to the distant site. 

so Accordingly, when the user clicks on a URL (or types it 
in a browser's search request), the user will connect to the 
requested site through the system 200. 

The present invention may be utilized with a wide variety 
of network addressable interface devices. When the inven- 

55 tion is utilized on a limited bandwidth, limited character set 
data network, the datastream returned to the user will pass 
through the LBLCS network 289. The datastream is modi- 
fied to fit the bandwidth, character set and display limitations 
of the network and the limitations of the user access device. 

60 II. WWW— CD ROMS 

The user of a network addressable interface device will 
select World Wide Web (WWW) data content for retrieval 
using a search engine to return selected WWW references. 
The user will then select and designate certain of these 

65 references to be included in a custom CD-ROM which will 
be burned or recorded onto a compact disc and then sent by 
express delivery to the user. 
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The user will designate pages and other WWW data 
content including but not limited to HTML files, audio files, 
still images and other graphic files from the WWW. Through 
the session management system 292, selected material will 
be designated and retrieved. The retrieved data will be 
included in a custom CD-ROM produced by a service 
bureau and then sent by a delivery service to the user. FIG. 
5 shows a process flow diagram. 

Optionally, the designated data may be communicated to 
the user via automated telephone means, may be commu- 
nicated to a user via electronic replication, or may be copied 
on to auxiliary computer storage such as through a floppy 
disk drive. 

III. Software Agent Advertising Information 
Advertising is provided which benefits the user while 

optimizing the advertiser's expenditure by only presenting 
ads or coupons (or ads and coupons in a rotation if multiple 
ads/coupons qualify) that are pertinent to that particular user. 

Certain criteria will be entered which delineates a pattern 
that is requested to be monitored. When this pattern is seen 
(or is in close match) in the user's WWW activity, the 
insertion mechanism is activated. If a certain web page is 
requested, the present invention will display a particular 
advertisement. The ad will be inserted based on the content 
of the existing web page being read. An analysis of the text 
stream of the user's interactive session will be performed 
online. When certain text patterns are observed (or close 
matches are observed), an advertisement is inserted into the 
display. 

The advertising may be static or connected to the adver- 
tiser's computer DataStore which designates specific ads or 
coupons based on the pattern match and other conditions 
which may be required. 

FIG. 6 illustrates a flow diagram for the software agent 
advertising insertion. 

The software agent criteria is entered by the merchant in 
the agent data store 230 which delineates a pattern that needs 
to be monitored. 

As an example, if the user accesses web pages for 
"Holiday Inns on the West Coast", the insertion mechanism 
would be established to automatically insert ads for "Hilton 
Inns on the West Coast". 

IV. Automated Profile Generation 

Browsing patterns of the user are analyzed and these 
patterns update profiles automatically. 

FIG. 7 illustrates a flow diagram for the Automated Profile 
Generation. 

The looking patterns of the user are monitored to develop 
a set of software text agent profiles that are integrated with 
explicitly collected profile information to assist the user in 
narrowing down information for future sessions as well as 
suggesting references, merchandise or services during the 
current session. This is accomplished by statistical analysis 
of the text stream. 

The searching patterns of the user on the Internet are 
monitored by monitoring the text stream. A set of software 
text agent profiles is developed and may be integrated with 
explicitly collected profile information. The explicit infor- 
mation is gathered by queries to the user. The explicit and 
implicit data are merged to develop software text agents that 
support the user's future shopping sessions. 

During a user's session, advanced text analysis tools are 
used in real time to understand the interests of the user by 
synthesis of the text pages looked at. This synthesis is used 
as input for statistical correlation with similar interests of a 
larger population. The results of this correlation are used to 
predict extended interests of the user. These are matched 
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using intelligent software text agents and a variety of 
reasoning techniques including case based reasoning and 
fuzzy logic to establish a recommended list of search ideas, 
promotions and specials. The use of collaborative filtering 

5 may also be employed. As an example, if the text analysis 
indicates that the user has looked at downhill and cross- 
country skiing, past usages from a larger population may 
indicate that the user will also be interested in ice skating. 
As seen in FIG. 7, real time analysis of data is illustrated 

10 at box 295. The real time session analysis is in communi- 
cation with the user interrupt system 294 so that the session 
may be interrupted at an appropriate point. At the same time, 
a post session profile update 2921 will update profiles based 
on browsing activity from a past session and thereafter 

15 stored in user profile DataStore 210. 
V. Automated Lead Generation 

It is known that suppliers will pay for information gath- 
ered about user's specific interests. When tied to a specific 
user, these become "leads" that a supplier can use for off-line 

20 follow up. The automated lead generation aspect will ana- 
lyze a user's profile and session looking activity against a 
profile established by a supplier. When this profile is 
approximately matched, the supplier is notified so it can 
contact the user to offer goods or services. Statistical analy- 

25 sis using complex software text agents is used to determine 
the match. 

FIG. 8 illustrates a flow diagram of the lead generation. 
In the present invention, the user's WWW viewing pat- 
terns are monitored. These and optionally the user's profile 

30 210 are matched against software text agents entered by a 
supplier in an agent DataStore 230. When these agents 
match a pattern or profile, the supplier is notified. 
Additionally, when this profile is approximately matched, 
the supplier is notified. Lead purchasers, using a user access 

35 system 100, will enter criteria that should be met for the 
generation of a lead. These criteria are in the form of 
complex software text search agents. When this threshold is 
met or exceeded, information is stored in the lead DataStore 
270 for variable output transmission to a lead purchaser. 

40 VI. Software Agent Unmet Needs Generation 

In the present invention, records will be maintained from 
user usage of the Internet and other networks on what 
consumer queries are unmet by the WWW content retrieved. 
FIG. 9 illustrates a flow diagram. 

45 If the user does not find what they are looking for, a 
"watcher" agent may be set up to advise them if the object 
of their search occurs at some future time. An example 
would be a tour, a price or some other information. Through 
the session management system 292 a threshold will be 

50 established on the user need. 

The invention will intuitively construct a profile from user 
inputted data. This will be done by recognizing unmet or 
unanswered queries and/or user initiated requests. From this, 
a profile will be developed to identify new markets. The 

55 system will thus be able to gather "negative" leads. This 
information may be extracted and sold to suppliers who will 
build new products and services and then use the system as 
a mechanism to notify the potential customer. 
Whereas, the present invention has been described in 

60 relation to the drawings attached hereto, it should be under- 
stood that other and further modifications, apart from those 
shown or suggested herein, may be made within the spirit 
and scope of this invention. 
What is claimed: 

65 1. An information aggregation and synthesization process 
to retrieve information in a network in which a user operates 
a network addressable device, which process comprises: 
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communicating between said network addressable device 
and at least one network accessible datastore through 
network addressing means; 

analyzing of a returned text stream from said network $ 
datastore; and 

retrieval from an advertising datastore and insertion of 
advertising or product discount information in the text 
stream based upon a threshold matching of a predeter- 
mined criteria and said text stream analysis. 10 

2. An information aggregation and synthesization process 
as set forth in claim 1 wherein said network specific address- 
ing means includes Uniform Resource Locators (URLs). 

3. An information aggregation and synthesization process 

as set forth in claim 1 wherein said analyzing is performed 15 
through an intermediary gateway system. 

4. An information and aggregation and synthesization 
process as set forth in claim 1 including the additional step 
of identifying graphic material in data returned from said 
datastores and replacing said graphic material with a text 
anchor. 

5. An information aggregation and synthesization process 
to retrieve information in a network in which a user operates 
a network addressable interface device, which process com- 
prises: 


,943 
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communicating between said network addressable inter- 
face device and at least one network datastore through 
network addressing means; 

accessing a text stream passing between said network 
addressable interface device and said network datastore 
by an intermediary gateway system; 

retrieval of advertising or product discount information 
from an advertising datastore based upon a threshold 
matching of predetermined criteria and the text stream 
analysis and insertion of the advertising or product 
discount information in said text stream; 

gathering of explicit information from said user and 
gathering of implicit information to develop a user 
profile; 

providing information about said user to a lead purchaser; 
and 

providing information to a third party to meet needs 
identified. 

6. An information aggregation and synthesization process 
as set forth in claim 5 wherein said network specific address- 
ing means includes Uniform Resource Locators (URLs). 

7. An information aggregation and synthesization process 
as set forth in claim 5 wherein said analyzing is performed 
through an intermediary gateway system. 
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